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1. Introduction 

The Unisys Enterprise NT Sizer provides system configuration recommendations lor Windows NT Database 
Server systems, based on customer workload requirements for applications running relational database- 
management systems (DBMS) in a client/server computing environment. The Sizer is a Windows application that 
runs within a Microsoft Excel application framework. Within this framework, the Sizer was designed to oiler a 
user friendly interface to accomplish complex tasks. 

This document describes the NT Sizer in the following manner: 

• Its current capabilities are summarized; 

• A user guide is provided; 

• The methodology behind the Sizer is described. 

2. NT Sizer Features 

The sizer is a Windows application that runs as a Microsoft Excel add-in. The sizer requires the Excel 97 version 
to run. The sizer was developed on a system using a 17" monitor with 1024 x 768 resolution and small fonts; 
although the sizer has been tested with smaller monitors and coarser resolution and font settings, it is possible that 
some dialogs may not be as easy to use in these cases. 

This tool is intended for use by analysts who will assist customers in selecting an NT system configuration that will 
meet the customed DBMS application and workload needs. Thus, it is designed to interact with the user to define 
the database, the application, the application workload and growth. The current version of the Sizer has the 
following features. 

2 A Comparator 

The TPC Comparator calculates ratios of performance and price/performance for published multi-vendor 
Transaction Processing Council Benchmark C (TPC-C) and Transaction Processing Council Benchmark D (TPC- 
D) measurements. The cases cover the UNIX and Windows NT 4.0 operating systems, as well as several different 
relational database management systems. 

The data used by the Comparator is first downloaded from the TPC WEB site !uini:^.>v.w,jPC,org and then 
processed to provide a user friendly interface to make quick comparisons of performance and price/performance. 

2.2 Size Pre-defined Workloads 

The Unisys Enterprise NT Sizer estimates system configuration requirements for Online Transaction Processing 
(OLTP) applications and workloads. These workloads may be pre-defined or user defined. Included in the set of 
pre-defined workloads is the TPC-C benchmark. Other specific, pre-defined applications will be included in ruture 
releases of the Sizer. 

2.3 Size User Defined Workloads 

The Sizer also allows the user to define applications and workloads according to the business case of the potential 
customer. The data gathered via the various dialogs for an OLTP system consists of: 

• Size of the database and its various indexes 

• A definition of the composition of each transaction in the application, e.g., number of SQL inserts, deletes, 
updates, and selects 

• The expected transaction rate, measured in transactions per second (TPS), by transaction 

• The processor, network interface card, and mass storage device utilization requirements 
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From this data the following system sizing information can be estimated: 

• Processor requirement, and expected processor utilization 

• Memory requirement 

• Mass Storage requirement 

• Number of users supported 

• Network Interface Card requirement and expected utilization 

Currently, estimates are available for applications using the SQL Server 6.5, SQL Server 7.0, and Oracle 8.04 
DBMS. The systems supported are the XR/6 and QS/2 Enterprise systems. 

Estimates given by the sizer are from the point of view of the NT Server only. Client configuration estimates are 
currently beyond the scope of this tool. 

2 A Mass Storage Sizing Only 

This feature, which is already incorporated into the process of system sizing for a user defined workload, allows tfo 
user to address mass storage requirements only. Two types of sizing are available. If the user has a very vague 
notion of the database definition, a rough estimation capability is provided, and is based on the input of only six 
parameters. If the user has more specific information about each table's size and the characteristics of its indexes, 
then a more detailed mass storage requirement estimation capability is available. In either case dialogs are 
provided to change some parameter settings which will be more suitable to the user's requirements. 

3. User Guide 

The NT Sizer installation diskette contains a single self-extracting zip file. To unzip, simply double-click on the 
file (NTSizer_280.exe) in Windows Explorer. A WinZip dialog box will appear, asking you to enter the path 
name where you would like the NT Sizer files placed. After entering the path name, click the Unzip button. 
When the unzip process is complete, a message box indicating that three files were unzipped will be displayed. 
Click OK, then click Close on the WinZip dialog box. The NT Sizer installation is now complete. 

Five files are required to run the NT Sizer. One is basically an executable which runs under the Excel framework. 
The other two support the functionality of the Sizer. More specifically, these files are the following: 

• NTSizer_280.xla, an Excel add-in file which contains the menu items, procedures and database required to 
perform a sizing; 

• Three templates which contain preformatted worksheets used by the Sizer to store information frot he user: 

- UserBk.xlt 

- Mass Storage Estimate.xlt 

- Mass Storage Estimate Detailed.xlt 

• Lic.xls, a password-protected workbook containing access information for the Sizer. 

All five files must reside in the same directory. 
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To start the Sizer, double-click on the add-in file, NTSizer_.280.xia from Windows Explorer, or start Excel and 
open the file using the File Open dialog box. The first time you use the Sizer, you will see the following license 
agreement. 

This license agreement will appear only once. Accepting the license agreement completes the installation. 
Declinina causes the Sizer to unload. 



UNISYS Enterprise Servei NT Size END-USER License 



END-USER LICENSE AGREEMENT FOR UNISYS Enterprise Server. NT Sizer SOFTWARE 

This UNISYS End-User License Agreement ("EULA") is a legal agreement between the end user and UNIS YS 
Corporation for the Enterprise Server NT Sizer, which includes computer software, associated media, printed 
materials, and "online" or electronic documentation ("NT Sizer"). By installing, copying- or otherwise using the 
NT SIZER, you agree to be bound by the terms of this EULA. Jf you do not agree to the terms of this EULA, .do 
not install or use the NT SIZER. 

NT SIZER LICENSE ' ^ " - 

The NT SIZER is protected by copyright laws and international copyright treaties, as well as other intellectual 
property laws and treaties. 

You may install and use one copy of the NT SIZER on a single computer. You may not reverse engineer, 
decompile disassemble, rent/lease; transfer, or lend the NT SIZER, UNISYS may. terminate this EULA if you , 
fail to comply with its terms and conditions. . / 



1 'AGREE TO T HE TERM5 OF THIS LICENSE 



I DO NOT AGREE TO THE TERMS OF THI5 LICENSE 



Each time the Sizer starts, a message box appears and provides the following information: 

• Brief summary of the Sizer's Functionality; the Sizer contains primarily two tools: the Enterprise Server NT 
Sizer for Windows NT Applications, and the TPC Comparator. The Enterprise NT Sizer estimates system 
configuration requirements based on the database size and transaction workload. The Comparator calculates 
ratios of published TPC-C and TPC-D results across several vendors, operating systems, and relational 
database systems. 

• Navigate through the sizer's functionality via the menu bars specially created for use with the sizer. 

• The Sizer's expiration date. The Sizer's Help menu bar directs you to contacts so that the Sizer's status can 
be upgraded. 

Click OK to close the message box. 

When the Sizer is opened, a copy of the workbook template UserBk.xit is loaded as UserBk/rxls where the value of 
n is dynamically assigned by the Excel program. This workbook acts as a storage area for system sizing results and 
input data where the writing of the information to this workbook is controlled by the Sizer. One may view how the 
data is stored by executing the macro that comes with Userbk.xls, MakeAHSheetsVisisble. This is done by 
selecting Tools, Macros. Note that Userbk«.xls contains several worksheets. The meaning and use of these 
worksheets will be discussed later. 
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3.1 The Sizer Menu 

The Sizer is operated bymakina selections from each of the three menu options added to the Excel menu bar: 
Sizer Menu, Workload, and Sizer Help. Note that all other Excel fanctionality is still available. Primary sizer 
functionality is initiated via the Sizer Menu. 
Selecting the Sizer Menu gives the following options: 

• Comparator 

- TPCC Comparator 

- TPCD Comparator 

• OLTP System Sizing 

- TPCC Workload 

- User Defined Workload 

• Mass Storage Sizing 

• Close Sizer 

• Exit Excel 

Selecting Close Sizer will cause the sizer add-in to unload from memory and the sizer menu to be removed. 
Selecting Exit Excel is the same as selecting File, Exit. 

The remaining subsections describe the capabilities associated with the Comparator, OLTP System Sizing, and 
Mass Storage Sizing features. 

3.2 Using the Comparator 

The TPC Comparator calculates ratios of performance and price/performance for published multi-vendor TPC-C 
and TPC-D measurements. The cases cover the UNIX and Windows NT 4.0 operating systems, as well as several 
different relational database management systems. 

The data used by the Comparator is first downloaded from the TPC WEB site hup: / www.tpc.oi-q and then 
processed to provide a user friendly interface to make quick comparisons of performance and price/performance. 
We also note that all of the ratings published at the TPC site are based on 100% processor utilization. 

Two tools are used to make comparisons. One is a comparison of metrics for the TPC-C benchmark, and the other, 
TPC-D benchmark. 



3.2.1 TPC-C Comparator 

To start the TPC-C Comparator, select Comparator , TPCC Comparator from the Sizer Menu. A dialog box, 
like the one below, will be displayed where you can choose a number of values for a baseline and target system. 
The Comparator will then calculate the ratio of tpmC's (transactions per minute) and S / tpmC (price per tpmC) o 
the target system to the baseline system. 
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TPC-C Conpaialoi 



Baseline System 

Operating System 

<" UNIX 

f Window: NT 4.0 
f~ Other 



System 



| Acer / Altos 19000?ro4 c/s 



Database Software 

C Inform* ^ SOL Server G. 5 

C Oracle 7 >C <~ SOL Server 7 0 

r Oracle S,W r 082 



~ Sybase 
r Other 



Piocessor 



Mhz No. CPUs tpmC . Price.'Perf Pale 



Pentium Pro/ 2 



200 4 11072 



28 



Target System 

r Operating System 

r unix - 

' & Windows 'NT 4 0 
r Other 

System 



Dalabate Software 

C Informix r SOL Server 6 5 T Sybase 

C Oracle 7.3K .jfOL Server 7 0! r Other 



r Oracle SX* ; <~ D62 



Ptocesscr 



Mhz No. CPUs ,,tpmC Piice/Perf -Date 



| Unisys / Aauanta QS/2 Server Pentium II Xeon/4 400 4 1815 4 



2b 



Target to Baseline Ratios — — 
tpmC: 1.64 



Price/Perl: 93 



Return 



~~3 



The two standard metrics used for comparison for the TPC-C benchmark are tpmC and S/tpmC. tpmC stands for 
transactions-per-minute-C; it is a measure of "business throughput" and represents the number of orders processed 
per minute. Five different transaction types are used to simulate the business activity of processing an order: New- 
Order, Payment, Order-Status, Delivery, Stock-Level. tpmC is the number of New Order transactions processed 
per minute. New Order transactions account for approximately 45% of the total TPC-C transaction workload. 
More information on TPC-C can be found on both the Performance Lab and TPC WEB sites [8,9]. 

To run the TPC-C Comparator dialog, first select the characteristics of the baseline system, i.e., the system to 
which you want to compare performance. The first choice is that of the Operating System, with the choices being 
Unix or Windows NT 4.0, or Other. The Other choice includes, for example, the IBM AS/400 Operating 
System. The second choice is that of the Database Software, where the following relational database management 
systems (RDBMS) are available: DB2, Informix, Oracle 7.3 and 8.0, SQL Server 6.5 and 7.09, Sybase, or 
Other. The Other choice includes proprietary RDBMS'. Once the operating system and RDBMS have been 
selected, a drop down list then shows which systems were tested and results subsequently published with the TPC- 
C benchmark measurement. Each line on the drop down list gives a summary system description plus the value of 
the New-Order rate (tpmC) and the corresponding price/performance estimate for the system measured. If the 
operating system and RDBMS choices are such that no values are available (e.g., SQL Server 6.5 on Unix), the 
drop-down list will show None. 

The comparison is completed by repeating the above process for the target system. When this is done, the ratios of 
tpmC and S/tpmC are shown at the bottom of the dialog. 

We note also that the Comparator responds in a dynamic manner to each change or choice by the user. Thus, no 
"re-calculation" request is required by the user for each change made by the user. 
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In the example shown above, TPC-C value for the Unisys Aquanta QS-2 system, configured with four (4) 400 
Intel Xeon processors is compared to that of an Acer system, configured with four 200MHz Pentium Pro 
processors. The tpmC Ratio shows that the Unisys system is yields a 60% higher throughput rate at 7% less c 



3.2.2 TPC-D Comparator 

To start the TPC-D Comparator, select Comparator , TPCD Comparator from the Sizer Menu. A dialog box, 
like the one shown below, will be displayed where you can choose a number of values for a baseline and target 
system. The Comparator will then calculate the ratio of QppD's (power metric), QthD (throughput metric), and S ' 
QphD (price/performance metric) of the target system to the baseline system. 



TPC-D Comparator 



Baseline System 

Operating System ■ 

• - r unix . 

Window* NT 40 

• r oibct 

System . ^ 



■■ Dal abate Software 

r Informix C SQL Server 6.5 <~ Sybase 

r Oracle 7.'K ■ ' r SQLServer 7.0 r Othet 



Processor 



Mhz Nodes. GPU^lode QppD QthD Piice/Peif Pale 



"3 



| HP ✓ HetServer LXr 8000 

Target System 
, ,/ , , Operating System . 

r unix . . . 

& Window* NT 4.0 
■ ' T Other . 

"System 



Pentium II Xeon 4 00 1 



1585 



519 513 



Database Software 

C Iriormut 
r Oracle 7. 3X 
- : fT Oracle aW 



O SQL Server 65 C Sybase 



r SQL Server 7.0 
r DB2 



r Other 



Processor 



Mhz Nodes CPU/N ode QppD QthD Piice.'Perf Date 



"3 



| Unisys / Aquanta <"'S/2 4 Way 



Pentium II Xeon 400 1 



1561 



530 357 



p Target to Baseline Ratios - 
QppD: .98 



QthD: 1.02 



Price/Perf: .69 



Return 



The power metric, denoted Opp].V<7\Sizc , is calculated as the product of the relative database size and the reciprocal 
of the geometric mean of the timing intervals. The relative database size is the scale factor used to determine the 
population of each table in the database. For example, a scale factor of 1 yields about 1 GB of raw data, etc. Thus, 

OmilVSi/c = 360Q * SF / [ (Q, * . * Q 17 ) * UF, * UF 2 ] A (1/19) 

where 

Q, = Elapsed time to run query i within a single query stream 
UFi = Elapsed time to run update function 1 
UF 2 = Elapsed time to run update function 2 
SF = Scaling Factor 
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Elapsed times are in seconds. The units of the power metric are queries per hour times the scale factor, thus the 
3600 multiplier. 

The throughput metric, denoted OihlK/Si/c , is calculated as the ratio of the number of queries executed to the 
length of the measurement interval, i.e., 

QthD&Si/c = (no. of query streams) * 17 * 3600 * SF/ (Length of measurement interval) 

Length of measurement interval is in seconds. The units of the throughput metric are queries per hour times the 
scale factor. 

The price/performance metric, denoted as S ; Onhi )^ -Size , is calculated as the ratio of the total system price to the 
composite query per hour rating which is the geometric mean of (JppD^Si/c and tjihl ) ^M/o, i.e., 

S/Oplulf«:Sizc = (total system price) / ( Ovv\~)(u : Sy/.c * (.JmP^Si/o) A ( 1/2) 

The units of the price/performance metric are dollars per (queries per hour times the scale factor). 

The operation of the TPC-D Comparator is identical to that of the TPC-C Comparator. Of course the exceptions 
are in the specific results. In this case we obtain the ratios of the power metrics, the throughput metric, and the 
price/performance metric. 

In the example shown above, TPC-D value for the Unisys Aquanta QS/2 system, configured with four (4) 400 Mhz 
Intel Xeon processors is compared to that of an HP system, configured with four (4) 400MHz Intel Xeon 
processors, configurations. The QppD andQthD ratios show that the Unisys system is within 2% of the HP system; 
however, the Uniys system costs approximately 30% less. 

3.3 Mass Storage Sizing 

A user may wish to determine only mass storage capacity requirements initially. This capability is provided via the 
Mass Storage Sizer feature. 

The calculations of mass storage requirements for databases are based on an analysis of the table and index 
structures for each of the DBMS supported by the sizer. The estimates of file size requirements to support these 
databases are based on both recommendations from the DBMS vendors as well as experience in the field. 

To estimate mass storage requirements select Mass Storage Sizing from the Sizer Menu. The dialog shown 
below is then displayed. 



New Sizing 



,■ Select source of input data 



From a previous sizing 



OK | Cancel | 
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The user selects whether this is a new mass storage sizing or tins is a carry over of a sizing that was previously 
saved to a workbook. 

If the user selects From a previous sizing, an open file dialog is returned and the user can then select the tile 
containing the results of a previous sizing. Note that the worksheet(s) in the previously saved workbook must have 
the same format as either of the two templates Mass Storage Estimate.xlt or Mass Storage Estimate 
Detailed.xlt. Upon selecting the file, the appropriate dialog is opened to modify the existing database definition, 
and make subsequent estimates. 

If the user selects New, the following dialog, Basis for Mass Storage Requirement Calculations, prompts the 
user to indicate first, for which DBMS the sizing estimates will be made, and second, the type ot calculation, based 
on the availability of information about the database. The Sizer currently supports estimation of mass storage 
requirements for SQL Server 6.5, SQL Server 7.0 and Oracle 8.04 databases. Also, estimates of mass storage 
requirements can be made, based on two levels of customer knowledge of the database, i.e., 

• Estimate: Little is known about the database 

• Detailed: Aggregate sizes of rows and indexes are available by table 



Basis for Mass Storage Requirement Calculations 



DBMS ^ : 
<* SQL Server 6 ,5 
C SQL Server 7.0 
C Oracle 8. OX 



Type of Calculation 
^Estimate} 
C Detailed *■>' 



Continue 



Cancel 



Comments 



Click Continue to proceed to the next dialog window. In the following two sections the dialogs for the two levels 
of calculations, Estimate and Detailed, are described. The example database used is based on the Pubs Database 
that ships with SQL Server. The examples used are for the SQL Server 6.5 DBMS. 

3.3.1 Estimate Calculation 



The estimated case is based on the user providing six items of information, shown in the following table. 



Input Item 


Description 


Number of Tables 


Total number of tables in the database 


Total Amount of Data, GB 


Total estimated size of the raw database, in gigabytes 


Average Number of Columns/Row 


The average number of columns per row for all tables in the database 


Average Row Size, Bytes 


The average size of a row in a database table, expressed in bytes. 


Percent Variable Length Columns 


The percentage of columns for all tables that are variable in length, 
e.g., varchar. Defaults to 15%. 


Average Size of Var. Length Columns 
per Table, Bytes 


The aggregate number of bytes for all variable length columns in a 
table, averaged over all the tables 
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The dialos used ;o make the estimates is shown below. 



Database Size* [Estimated) - SQL Seivei 6.5 



Input Parameters . - -- 

; • . Number of Tables 

. Total Amount of Data (GB) 

Average Number of Columns/Row . 

Average Row Size (Bytes) 

Percent Variable Length Columns 

Average'Size of Variable Length Columns Per 
; Table (BvtesV 



11 



20.7 



5.8 



81 



| 23 



60 



(Default: 10) 



(Default: 15%) 



; Mass Storage Requirement (GB) . - 

Database Total .. 28.76 

Application and OS Space ^ 1.00 

System Tables 0.10 

Scratch & Sort Space 10.07 

Log file 7,19 

Growth 7.19 



Total Requirement (GB) 



OK 



54,31 



Make Estimate 



! 



Cancel 



Parameters 



After having entered the six parameters characterizing the database, the mass storage requirement estimate is made 
and displayed via a click of the Make Estimate button. We note that the size of the page file is not included in 
these estimates. This is because the pagefile size is a function of the memory size requirement which is a function 
of the application load and CPU requirement. 

Selecting OK or Cancel causes the form to be unloaded. Selecting OK gives the user the choice of saving this 
information to a workbook having the same format as the template Mass Storage Estimate.xlt. For a subsequent 
mass storage sizing, this same workbook can be loaded, the information contained in it is loaded into the sizer 
database and corresponding form. Note also that selecting OK keeps the sizing information in the sizer database 
so that in a subsequent sizing, during the same sizer session, this information is loaded into the form. Selecting 
Cancel causes the sizing information to be cleared. 

In order to use the few parameters listed above to make the mass storage requirement estimates, certain additional 
assumptions are required in the areas of indexing and other space requirements. The indexing assumptions are 
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based on experiences in the field with various client databases. The space assumptions when using the SQL Server 
DBMS are based on Microsoft's recommendations tor SQL Server 6.5 and Windows NT 4.0. The space 
assumptions when using the Oracle DBMS are based on Oracle recommendations as well as Unisys experience 
with its proprietary RDBMS. 

A dialog is provided to show not only the default parameters values, but also to allow the user to change the values 
to whatever may be more applicable to the application being considered. This dialog, which is accessed via the 
Parameters button, is shown below. Note that the rightmost column contains values that can be modified. Also, 
the user can easily revert to the default values via the Use Defaults button. Further information on parameters is 
given via the Comments button. 



Estimate Assumptions for SQL Server 6.5 



Parameters Used in Calculating Mass Storage Requirement 



Parameters used in both Estimated and Detailed Cases 

PageSize, Non-settable -~ ; 

Fill Factor " 
Log File Space; per cent of formatteddatabase size, including indexes , 
Temporary Space; per cent of formatted database size, including indexes 
OS and application software, GB 

System databases, GB : .-, ; '•• . 

Per cent growth in database 

Pagefile space; per cent of memory size 



Assumptions used only in Estimated Case 

Average number of non-cfustered indexes per table 
Average number of Fixed Length fields per non-clustered index 
Number of Variable length fields per non-clustered index 
Average number of cluster indexes per table 
Average number of fixed length fields per cluster index 
Number of variable length fields per cluster index 



Default 
2048 
85 
25 
35 
1 

' 0.1 
25 
110 



Default 

0.3 
1-2 
0 

1 

1.3 
0 



OK 



Cancel 



Use Defaults 



Current 
2048 

•95 

' 25 
• . 35 
1 

. S 0.1 
. 25 
110 



Current 

0.3 
1.2 
0 
1 

1.3 
0 



New' Unit 
. B 



33 



25 



35 



0.1 



25 

I 110 % 



New 



0.3 



1.2 



1.3 



Comments 



% 
% 
X 
GB 
GB 
% 



The above dialog has the same appearance for each sizer supported DBMS. The set of default values for SQL 
Server 6.5 and SQL Server 7.0 is the same with the exception of the page size which is not modiiiable (2048 for 
SQL Server 6.5 vs. 8192 for SQL Server 7.0). For Oracle, the parameters are as shown in the corresponding, 
following dialog. Note that Oracle's indexes using a B-Tree always have a leaf page level. Consequently, Oracle 
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does not have an equivalent "cluster" key, as defined for SQL Server. Addtionally, note that the sizer supports ^ 
only size estimates for indexes characterized via B-Tree algorithms. Thus, the calculation of size requirements tor 
cases of Oracle's "clustered tables and indexes 1 ' and "hash indexes" are not supported by the sizer. 



Estimate Assumptions for Oracle 8- OX 



Parameters Used in Calculating Mass Storage Requirement 
Parameters used in both Estimated and Detailed Cases 



DB.Bbck.Si2e (1 024, 2048, 4096/81 92) 
100 * PCTFREE 

Log File Space (Redo log); per cent of formatted database size, including 
Temporary Space (support sorts, 'index ; creation. etc.); per cent of formatted 
OS and application software, GB ; . 
System databases (system tablespace), GB ' 
Per Cent growth in database • 

Pagefile space; per cent of memory size ; " / 

INiTRANSJ). • •: • ^ ■ ■ v 

INITRANS.X . : - . '■. ... .'v. ; , ;.■ 

Assumptions used only in Estimated Case 

Average number of indexes per table 

Number of Fixed Length fields per index ... 

Number of Variable length fields per index 



Default 


Current 


New 


Unit 


,4096 


4096 . 


4096 


B 


90 


90 ; 


30 


% 


25 


25 . , 


25 


Z. 


35 


35 ■ 


35 


X 


1 


; 1 


1 


GB 


0.1 


0.1 


0.1 


GB 


- 25 


' 25 


25 : 


• X 


• : 110 


110 ~ 


110 


% 


1 


. 1 : 


1 




. 2 


V; ■'? : ■, 


2 




Default 


Current 


New 




13 


1.3 


1.3 




1.3 


1.3 ... , 


1.3 




0 


. o 


0 : 





OK 



Cancel 



Use Defaults 



Comments 



3.3.2 Detailed Calculation 

The Detailed case is based on the user providing information about the size of the rows in each table and index 
the database as well as the amount of space that should be made available for new rows or updating rows. This 
information includes the following: 
• For each table 

- Number of fixed size columns 

- Total fixed bytes per row 
Number of varchar columns 
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Total varchar bytes per row 
N T umber of rows 

- Fill Factor (SQL Server) or i 00 - PCTFREE (Oracle) 
• For each index (or clustered index for SQL Server case) 

- Number of fixed size index columns 

- Total fixed bytes of index columns per row 

- Number of varchar index columns 

- Total varchar bytes o f index columns per row 

- Fill Factor (SQL Server) or 100 - PCTFREE (Oracle) 

The dialog used to provide this data to perform the calculations is shown below. If the user had selected New 
sizing and a previous, detailed had been completed during the same sizer session, or, if the user had selected 
Previous Sizing and loaded a workbook with detailed sizng results, then the dialog would be loaded with the 
description of the database as is shown below. Otherwise, the form would be empty. 



Database Sizer (Detailed) • SQL Set vet 6.5 



Record Type 



IT 



*[ i i r 



"Table Name Index Www *' Record Type 



' - Ftxed 
No. No.- 
Cob Bytes 



Varchar 
No. No. X Fill 
Cots Bytet Factor 



No. 
■ Of 
Row 



NO; 

Pages 



No. Storage . 
B : Tree Space; 
Levels (MB J 



authors 

authors 

authors 

discounts 

employee 

employee 

employee 

job* 

jobs 

pub_info 
pub_inlo 
oubfishers 



au id 
AK1 



emp id 
AK1 

iobjd 

pub_id[FK) 



Save end Continue 



Data 5 
Clustered Index 1 



Index 
Data 
Data 



Clustered Index 1 



Index 
Data 

Clustered Index 1 



Data 



1 



Clustered Index 1 
Data . . . 2 



Cancel 



31 

11 

0 

16 

25 

9 

1 

4 

2 

4 

4 

5 



120 


95 


23.000.000 


2.555.556 




0 


95 




22.032 


5 


GO 


95 




963.310 


7 


40 


35 


3,000.000 


111.112 




50 


95 


43.000.000 


2.150.000 




0 


95 




16.166 


4 


50 


95 




1,544.541 


7 


50 


95 


14.000.000 


500.000 




0 


95 




1.853 


4 


305 


95 


8.000.000 


2.000.000 




0 


95 




9.571 


4 


90 


95 


.8.000000 


500.000 





Parameter* 



If an "** appear* next to a line in the Storage Space column, the extra 2 torn per page 
added by SQL Server wi not be ava3able because number of rows per page w8 exceed the 
allowable amount. The FiS Factor is changed to 1 00. 



5.233.78 

45.12 

1,973.83 

227 56 

4.403.20 

33.11 

3.163.22 

1,024.00 

3.79 

4,096.00 
19.60 
.1 .024.00 



Mats Storage Requirement*. GB ~~ 

Database Total 41.24 

Application and OS Space 1 .00 

System Tables . 0.10 

: Scratch?, Sort Space 14.44 

Log Be 10.31 

Growth 10.31 

Total Requirement [GB| 77.40 



All input and modifications are made in the upper text boxes and upper buttons of the form. The user first sele< 
the type of entry from the Record Type drop down, as shown in the next dialog. Data is selected if the user is 
going to add the characteristics for a table. Similarly, Clustered Index or Index is selected if the user is going 
describe the characteristics for an index of a previously defined table. 
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Database Sizet (Detailed) SQL Seivei 6.5 



Record Type 



nr 



.Data 

Clustered index 
Index 



i r 



c> gtage 



When a Record Type selection has been made, the text boxes and the Add Record button are enabled, as shown 
m the next dialog. " The user then supplies the information in each text box. Once the data has been supplied, 
clicking the Add Record button will cause this entry to be added to the list box and the appropriate calculations to 
be made. 

Note that when there are more than one table defined, the Table Name textbox becomes a drop down combo box 
listing all of the tables. This is useful when adding an index to a table. 



Database Sizer (Detailed) - SQL Seivei 6.5 



Record Type 



Table Name 



Data 



jj| New _T able | 



Lfojd Record, l| 



Wo. 



Fixed Varchar' ' No.. 

No. No. No. ■ No. >BI 01 , 

Cds .Bytes Cofc Byles • Factor .Rows 



j 12 | 250 | 3 fli | 95 | 2000000 
Clear [ ' V 



r 



. No. . 
8 Tree 



Modfications can also be made to existing entries. This is done by first selecting the entry in the list box, as shown 
below. 



Database Sizer (Detailed) • SQL Server 6.5 



Record Type 



Table Name 



Index Name 



Fixed 
No. No. 
Cols Bytes 



[ Clustered Index *]\ 



Resubmit 



| aujd 
Delete 



I 1 
Cle3r 



11 



Varchai 
No. No X Fill 
Cols Bytes Factor 



Table Name Index Name Record Type 



Fixed Varchar No. 

No. No. No. No. *Fill Of 
Cols Bytes Cols Bytes Factor Rows 



95 



No. Storage 
No. B-Tiee Space 
Pages Levels (MB) 




4 -» n/vt rtftn 



This causes the entry to be displayed in the first line of the enabled text boxes. After modifications are made as 
necessary in the text boxes, the user clicks Resubmit to make the list box entry change and to perform the 
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necessary recalculations. Selecting Delete will cause the list box entry to be deleted; note also that if yoi 
deleting the information for a table, then :he corresponding index information is also deleted. Seiecring 
cause the text boxes to clear and to de-select the list box entry. 



The output from each entry provides the following information: 

• The mass storage requirement for each table and each index 

• The number of B-Tree levels associated with each index. This is the number of pages (SQL Server) or blocks 
(Oracle) that must be read from the disk and/or memory (cache) in order to access the first item of data. 

Additionally, the box in the lower right corner provides summary information about the mass storage 
requirements. Note that this summary information is in the same format as that for the Estimated case. 

Selecting the Parameters button will open the same dialog as the one shown for the Estimated case. Selecting 
Save and Continue or Cancel, will unload the form. If Cancel is selected, the information in the form is cleared. 
If Save and Continue is selected, the user is then solicited to save the results to a workbook having the same 
format as the template Mass Storage Estimate Detailed.xlt. 

Note also that selecting Save and Continue keeps the sizing information in the sizer database so that in a 
subsequent sizing, during the same sizer session, this information is loaded into the form. This capability allows 
the user to switch from a detailed sizing for one DBMS to a detailed or estimated sizing for the same or different 
DBMS. Similarly, the user can switch from an estimated sizing to an estimated sizing for the same or different 
DBMS; however switching from estimated to detailed with the same information is not possible. 

3 A Using the NT Sizer 

Currently, the Enterprise Sizer for Windows NT Applications estimates configuration requirements for OLTP 
workloads. A TPC-C workload may be selected, or the user may define his own. For the TPC-C workloads, one 
may also compare the resulting tpmC estimates to those of existing systems. 

3.4.1 TPC-C Workloads 

To size a TPC-C workload, select OLTP System Sizing, TPCC Workload from the Sizer Menu. The dialog 
shown below is then displayed. 
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Eotetprrte Serve* Configuration Estimator tot TPC-C Workload* 



Client: 



Your Client's ridme 



Operating System 

^Windows NT 4.0 
C Other 



Baseline System 

Database Software 

C Informix 
f^Orade 7.2< 
Oracle 8 >X 



f* SQLSeiveiSS 
r -SQLServer 7.0 
C DB2 



(~ Sybase 
f Other 



System 



. Processor 



Mhz Wo. CPUs tpmC 



Pnce/Peff Date 



Acer s Altos 19000Pro4 c/s 



Pen t i u» Pro--' 2 



200 



L1072 



28 



Enterprise Server Target System 

(Windows NT 4:0 OS. SOL Server 7 Q DBMS) 



- Input Data — 

Enterprise Server 
Max. Ptocesioi Utilization. % 
tpniC Requirement 
Elective TPS. 



QS/2 



3 



80 



140C0 
. 530 



-Predictions — • : — — — — — ~ 

CPU Requirement 4 

Effective CPU Uiifization, % - , ' 77. 

Merrory Requirement. MB 4096. 

Mass Storage Requirement. GB 979 

No of. Users Supported 11296 

tpmC Ratio (Target to Base) - 1:26 



Calculate 



The top half of the dialog provides the information from the TPC-C Comparator to allow you a basis or baseline to 
which you can compare your projected target system estimates. The lower left quadrant of the dialog allows you to 
select the target system and to specify its performance characteristics. 

Estimated and measured TPC-C performance data is available for NT servers running the SQL Server 7.0 DBMS 
on the more recent and future systems; these systems include the XRJ6 with up to 12 processors, the QS/2 with up 
to 4 Xeon processors, and the to -be -announced QS/2 follow-on (FO) which will be offered with configurations up 
to 8 processors. The "target" systems for sizing purposes comprise these systems. Older systems are not included 
in the set of target systems. 

For the target system to be sized, you must specify the following as indicated in the lower left quadrant of the 
dialog: 

• The specific server from the choices of XR/6, QS/2, and QS/2 FO 

• Maximum processor utilization 

• tpmC requirement 

The maximum processor utilization is the processor utilization level that you do not wish to exceed with the 
specified workload on the proposed system. Specifying a maximum of 100% is not recommended, as response 
limes degrade as the processor utilization approaches 100%, and it also provides no room for growth. Specifying a 
value too low will provide configuration requirements that far exceed the input requirements. If you do not have a 
processor utilization number in mind, use 80-85% as this will provide a reasonable estimate with a safety margin. 

Since this type of sizing is based on a TPC-C workload, you must also specify the tpmC requirement. If you do not 
know what tpmC value to use, start with a baseline system and increase or decrease the tpmC value accordingly. 
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Once the baseline and target system characteristics have been selected, click the Calculate button. Results oi the 
calculations are shown in the lower right quadrant of the dialog. 

For example, in the dialog above, the Unisys QS/2 server with the Xeon 400 Mhz processor is being compared to 
the Acer 4 way system with the 200 Mhz processor. The tpmC requirement was input as 14000 on the QS/2 
server with a requirement that the processor utilization not exceed 80%. This results in a configuration requiring 4 
processors that will operate at 77% utilization on the average. Further, this system provides 26% more throughput 
than the baseline. Also shown are the estimated processor and memory requirements of 4096 MB and 978 GB, 
respectively. 

Note that when you enter or change a tpmC value, the corresponding value for Effective TPS changes 
automatically. This number represents the total number of transactions per second that could be realized for the 
target system with the specified tpmC value. The tpmC value represents transactions per minute for TPC-C New- 
Order transactions, where New Order transactions represent approximately 45% of the total transaction workload. 



3.4.2 User Defined Workloads 

This section will show you how to use the Sizer to estimate system configuration requirements for a user-defined 
OLTP workload. It will guide you through the steps required to define the database and the application workload 
characteristics. 

3.4.2.1 Previous vs. New Sizing 

Select OLTP System Sizing, User Defined Workload from the Sizer Menu to size a user-defined OLTP 
workload. You are then presented with the New Sizing options window shown below. You have the option of 
performing a totally new sizing or continuing from a sizing which was previously saved to an Excel workbook. 
Whether you select New or From a previous sizing, you will be lead through a three step process which will 
produce a system configuration estimate. 

If you select a previous sizing, you will be queried for the workbook name via the standard Excel Open File dialog. 
We note that if you select a file not previously saved via a sizing, or altered since it was saved as sizing results, the 
sizer does not necessarily recognize this; consequently, the results are unpredictable. The workbook selected must 
have the same formatted worksheets as those in the template UserBk.xlt. 




(-Select source. of input data"*;* 



From a previous sizing 



OK | 



Cancel | 
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3.4.2.2 Hardware Selection 

The main dialog from which the multi-step sizing process functions is shown below. One firs: selects the system 
type and the desired backbone LAN speed. The various systems from which to select are the following: 

• XR/6 with 200 Mhz Pentium Pro processor 

• QS/2 with 400 Mhz Xeon processor 

• QS/2 with 450 Mhz Xeon processor 

• QS/2 follow-on (FO) with 400 Mhz Xeon processor 

• QS/2 FO with 450 Mhz processor 

A corresponding maximum processor utilization is also specified. 

We note that some of these systems are not as yet available; however, there is available data from which we can 
estimate system requirements. Thus, this provides some predictive capability for our future systems. 

The various LAN speeds from which we can select are the following: 

• 10 Mbit Ethernet 

• 1 0 Mbit Switched Ethernet 

• 100 Mbit Ethernet 

• 100 Mbit Switched Ethernet 

• 1 Gbit Switched Ethernet 

• Best Fit 

Selecting Best Fit allows the Sizer to determine the smallest LAN speed that will satisfy the expected LAN traffic 
at or below an optimal, maximum utilization. The optimal utilization is dynamically entered and displayed with 
the Network Interface Card maximum utilization whenever the corresponding LAN speed is selected. 
Additionally, this value can be overridden by the user. For example, selecting a 100 Mbit LAN speed, the 
maximum, optimal utilization is considered to be about 35% which is entered as the Network Interface Card 
utilization; the user can then optionally override this value using the corresponding spinner. 
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OLTP Usei Defined Woikload 



Client: (Your Client's Name 



Hardware Data 

Processor Type 



1400 Mh2 OS/2 



Backbone LAN 5peed ! 10 Mbit Ethernet 



* Maximum Utilizations, % - ™ • - 

: Processor 

Network Interface Card 



85 V 

35 



Ej 
~3 



r Make Selection 



(? step 1 :~ Estimate Mass Storage Requirements 
C step 2: Define OLTP Application and Workload 
C Step 3: Estimate Configuration Requirements 



r i 

r •! 



Perform Function } Return to Menu | Instructions | 



Having selected the hardware, the next two steps are to estimate mass storage requirements and define the 
application and workload. Selecting the hardware and these two steps can be done in any order. To estimate mass 
storage requirements, select the radio button corresponding to Step 1, and click the Perform Function button. 

3.4.2.3 Estimate Mass Storage Requirements 

The process to estimate mass storage requirements was described in Section 3.3. The process described in that 
section is identical to this portion of the system sizing process. 

Selecting Step 2, we can now define the application and its workload. 

3.4.2.4 Define OLTP Application and Workload 

The user is taken to a worksheet from which he performs a series of steps to define the application and the 
workload. A transaction consists of a series of SQL statements surrounded by a BEGIN TRANSACTION and a 
COMMIT. 

If you are conducting a new sizing, you will be presented with a blank worksheet that looks like the one shown 
below. You must use the Workload menu options to define the number and content of the transactions. This 
process is described in the following sections. 
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Workload Definition 



Status I 



Txn No. 



Txn Name 



Txn/Sec % of Total 



Description 



If you are using a previous sizing, the worksheet will already contain the workload definition as defined during the 
last sizing. 

3.4.2.4.1 Workload Definition Process 

A worksheet, shown below, is processed using the Workload menu which consists of the following five items used 
to define the application and the workload. 



Menu Item 

Instructions 

Add Transactions 
Delete Transactions 
Transaction Composition 
Return 



Description 

Provides a set of online instructions to guide you through the workload definition 
phase. 

Adds the specified number of transactions to the application. 
Deletes the specified transaction from the application. 
Allows you to define SQL statements for a specified transaction. 
Return to the three step sizing process main menu. 



A recommended sequence is the following: 

► Specify how many unique transactions comprise the application: From the " Workload" menu, selecting 
"Add Transactions" will allow you to add a user supplied number of transactions to the application. The 
added transactions are highlighted via shading. 

» Generically identify each transaction by name: Fill in the transaction name column, and provide a short 
description, if desired. 

► Specify each transaction's load on the system: Fill in the expected rate in the transaction per second (TPS) 
column. 

► Specify each transaction's composition: SQL inserts, deletes, updates, selects: This is done via a 
subsequent dialog as a result of selecting the Transaction composition button. This process is described in 
detail in Section 3.4.2.4.2. 



Workload Definition 


Txn No. | Txn Name 


| Txn/Sec 


% of Total | Description 


{ Status 


1 New Order 


35 


26.9% 


Defined 


2 Payment 


35 


26.9% 


Defined 


3 Add Publisher 


10 


7.7% 


Defined 


4 Delivery 


35 


26.9% 


Defined 


5 Delete Stores 


15 


11.5% 


Defined 


Totals 


130 


100.0% 





This process is repeated until all of the transactions have been defined. Note that you can both add and delete 
transactions during this phase, thus allowing you to further modify the application and workload definition. Select 
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the "Instructions" menuitem from the Workload menu for extra help on completing the information m this 
worksheet. 

3.4.2.4.2 Composition of OLTP Transaction 

Recall that each transaction can consist of several SQLs. To activate the transaction composition dialog shown 
below, highlight a transaction in the worksheet and then select the Transaction Composition menuitem from the 
Workload menu. In this dialog we can now specify the various types of SQLs comprising a given transaction. We 
can add an Insert, Delete, Update, or Select SQL by selecting a radio button in the Add a SQL group. Selecting 
one of the four SQL types causes a corresponding menu to be displayed in the Edit SQL Parameters group 
located in the lower half of the dialog; each of these menus is also shown in the dialog below. A default SQL 
Name is also entered with its suffix randomly generated to guarantee uniqueness. You can use the generated name 
or liighlight it and enter your own name for the selected SQL statement. 

Choosing an Insert, Delete, or Update command will highlight the corresponding Number of SQLs item. Use the 
spinner to specify the number of Insert or Delete SQLs, or the number of records Updated. 

For the Select statement, use the highlighted spinner to specify whether it is a single table Select, or a nested join 
from either two or three tables. In ail cases, the assumption is that the Selects are indexed, which is consistent 
with OLTP applications. For all Select cases, the Selectivity Criteria must also be completed. For a Select from a 
single table, the selectivity refers to the number of rows selected from the table. For a two-table join, the selectivity 
refers to the number of rows selected from the outer, or left, table; the inner table selectivity is assumed to be an 
average of four for each row selected from the outer table. For a three-table join, the selectivity is equal to the 
number of values in the WHERE clause that pertain to the outer table. For further clarification on how each of the 
select cases may be applied in a specific OLTP application, click the corresponding Show the SQL button and 
refer also to section 4. 

Additionally, for the Select statement, the number of columns and aggregates per row returned are requested. 
These values together with the number of rows returned are used to estimate the amount of traffic on the LAN per 
SQL and transaction. The number of columns per row are not currently used to calculate processor usage. 

For each SQL added, you must save the changes by clicking the Commit Changes button. This action causes the 
newly created SQL to be added to the Current SQLs list box. You may also modify your definition of each SQL by 
selecting the SQL from the list box. This causes its definition to be highlighted in the Edit SQL group. Upon 
completion of editing, commit the changes. 
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Add A SOL 

\C Qefaie [ - 
| ^Update! - 
\r Select j - 



Composition of Transaction No. 2: Payment 

Current SQLa 



Ad J A SQL - 




*~ [insert] 


C Update 


C Delete 


C Select 



1 



SCLJJpe 5t * _v *' tr cl> £ e 

SOL_Lpeeie_0ttt.*is: 
SGL.SdJbstntt 

S'X_Selccl_CuTt ^ I 




Return to Previous Dialog 



Edit SQL Parameters 



SOL Name | SGLJnsert.SNP 
of SQL? 

Edit SQL Parameters 



Commit Changes I - CaneeJ 



i 



SOL Nsme [ SOC_Oeitrte,ATV 
Number of SOLS ' 1 

- Edit SQL Parameters- 



Commit Changes 



Cancel 



(Ave Record* ' Delete • 4) ; 



SQL Mams | SQL_up«e_*u Commit Changes | /Cancel 
Records per SQL 1 • 



<Sinc*e Table) 



, Edii SQL Parameters^ 



' SQLName | SGi_Seteci_SBK 

Matter of TaUes 

Selectivity Criteria 
Row* Selected 



Commft Changes \ 



(Indexed Ort^) 



Cancel 



Rows 



i 



Columns 



Column Info 



Show (he SOL 



Once you have defined this transaction, select Return to Previous Dialog. This takes you back to the worksheet 
where you can select another transaction for further definition. As noted previously, you can always return to this 
dialog for further definition of a selected transaction. Continue this process until all transactions are defined. 

3.4.2.5 Estimate Configuration Requirements 

Once all of the transactions have been defined as in section 3.2.2.2, we now have enough data to estimate the 
configuration requirements. Select the radio button for Step 3 and click Perform Function. Note that the Status 
box must be checked for both steps 1 and 2 in order to proceed with the third step. 
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The sizer now estimates the configuration requirement. An example of the format of the results is shown in the 
following dialog. Input requirements are shown above the horizontal bar. The items below the bar are calculated 
by the Sizer. 



Enterprise Server OLTP Sizing Results 



13 



System Requirements 



/ 



Workload 

Database Software 

System 

Processor 

Txn/Sec Requirement 
Max CPU Utilization, % 



CPU Requirement 
Effective CPU Utilization, % 
Memory Requirement, MB . , 
Mass Storage Requirement, GB 
Communications Requirement 
Communications Bytes/sec 
Communications Utilization % 



Modify Requirements 



OLTP User Defined 
SQL Server 6.5 
Unisys QS/2 
4Q0MhzXeon 

130 

85 ' "' 



\ 



2 

54 .. .. 

1024 

78,5 

10 Mbit Ethernet 

432/393 

34 



Detailed Results j 



Return to Menu 



3 



The CPU Requirement indicates the number of processors required to support the defined workload; the 
processors are of the type specified by the System field. The Effective CPU Utilization is the estimated average 
total processor utilization for the specified workload. The Memory Requirement is given in megabytes (MB). 
The Mass Storage Requirement, given in gigabytes (GB), is the total amount of disk space required to support the 
given application; this includes following: 

• Formatted database size 

• Windows NT operating system, the DBMS, and application files 

• DBMS system tables 

• Scratch and sort space 

• Transaction log file 

• Paging file 

• Growth 
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The t>pe of ethernet card required to support this workload is indicated as well as the expected amount oi LAN 
traffic along with the expected ethernet card utilization. 

The buttons at the bottom of the dialog provide you with several options. Click Modify Requirements to return to 
the sizing main menu, where you can make changes to the database size or transaction workload assumptions and 
then have the Sizer recalculate the system configuration requirements. The current results will not be saved. 



3.4.2.6 Saving Results 

The Detailed Results button will save the sizing results onto the corresponding worksheets used in the 
UserBkw.xls workbook. The user is then prompted to save the resulting workbook to disk. The names and 
contents of the resulting worksheets are listed in the table below. A sample of each of the results worksheets is 
provided in the remaining sections. 



Worksheet Name 


Contents 


Mass Storage Sizing - Summary 


Summarizes the mass storage sizing. Includes input parameters on a database 
basis, summary output data, and the additional parameters used to make the 
calculations _ . 


Mass Storage Sizing - Details 


If a detailed sizing was done, then this worksheet contains the per table details 
entered via the mass storage dialog. 


OLTP Input - Workload 


Contains the defined transaction workload. 


OLTP Input - Transactions 


Contains the defined transaction composition for each transaction 


OLTP Sizing Results 


Overall results of the sizing. 


OLTP CPU Load 


Bar chart depicting relative CPU utilization for each transaction in the mix. 


Comin Load 


Bar chart showing relative usage of the ethernet interface card for each 
transaction . . 


Capacities - CPU 


Chart showing estimated peak transaction rates for mu It 1 -pro cess or 



The Return to Menu button will return you to the Sizer "home page". If you press this button without having 
exercised the Detailed Results option, the Sizer will not retain the sizing just completed. 



3.4.2.6.1 Worksheet Results: Mass Storage Sizing 



The estimated mass storage requirements are placed in one or two worksheets, depending on the type of mass 
storage sizing conducted. If the mass storage sizing was of the Estimated type, then the input parameters and 
results are placed in a worksheet called Mass Storage Sizing - Summary. If the mass storage sizing was of the 
Detailed type, then the input parameters and results are placed in both the iMass Storage Sizing - Summary and 
Mass Storage Sizing - Details worksheets. 

For both the Estimated and Detailed types, the Mass Storage Sizing - Summary worksheet has the format shown 
in the "Mass Storage Sizing Summary" table shown below. The information includes the six input parameters 
specified for the Estimated case, the summary mass storage requirements, and the additional sets of parameters 
required to make the calculations. For a Detailed case the six input parameters are weighted averages of the data 
supplied to the detailed sizing. Note that the mass storage estimates exclude page file size. This is because the 
results contained here were calculated during the database sizing portion of the exercise, and paging file 
requirements are based on total memory requirements. Total memory size was not known until the final system 
configuration estimate was completed. 
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, . . 

Mass Storage Sizing Summary 

SQL Server 6.5 

(Based on Detailed Database Information) 



Summary of Input Data 



Description 


Value 


Number of Tables 

Total Amount of Data, GB 

Average Number of Columns/Row 

Average Row Size, Bytes 

Percent Variable Length Columns 

Averaae Size ofVar. Lenath Columns, Bytes • ' 


11 
20.7 
Su8 
81 
23 
60 


Mass Storaqe Estimates 


Description 


Value 


Database Size, GB 'V r 
Application and OS Space, GB 
System Tables, GB 
Scratch and sort Space 

Growth. ' • . . " • 
Log Files . 


41.24 
1 

0.1 
14.44 
10.31 

.10.31 


Total Mass Storaae Retirement 


77.4 


'Excludes page file 

Assumptions Used in Calculations 


Description 


Value 


PageSize, Non-settable 


. ' • ■ . 2048 


Fill Factor 


95 


Log File Space; percent of formatted database size, including indexes 


25 


Temporary Space; per cent of formatted database size, includinq indexes 


35 


OS and application software, GB 


1 


System databases, GB 


0.1 


Per cent qrowth in database 


25 


Paqefile space; per cent of memory size 


110 



NT Performance Services 



Unisys Corporation 



Copyright _ 



Appendix A. Unisys Enterprise NT Sizer Description and User Guide A-31 



For the Detailed case, the Mass Storage Sizing - Details worksheet has the format as shown in the "Database 
Statistics by Table" table below. This lists the same information 3S that given in the detailed sizing form. 



Database Statistics by Table 

























Storage 








Fixed 


Fixed 


VarChar 


VarChar 


Fill 


Number 


Number 


B Tree 


Space 


Tabte Name 


Index Name 


Record Type 


Columns 




Columns 


Bytes 


Factor 


of Rows 


of Pages 


Levels 


(MB) 



authors 




Data 


5 


31 


120 


4 


95 


/ 23,000,000 2,555,556 




5,233.78 


authors 


au id 


Clustered Index 


1 


11 


0 


0 


95 


22,032 . 


5 


.45.12 ...... 


authors . 


AK1 


Index 


0 


0 


60 


2 


95 


♦ 963,810 


7 


1.973.88 


discounts 




Oata 


4 


16 


40 : 


1 


95 


3.000,000 111.112 




227.56 


employee 




Data 


6 


26 


50 


2 


95 


43,000,000 2,150,000 




.4,403.20 


employee 


emp_id 


Clustered Index 


1 


9 


0 


0 . 


95 


. 16,166 


4 


33,11 


employee 


AKl" 


Index 


1 


1." 


50 


' 2 


95 


1 ,544,541 7 


3.163.22 


jobs 




Data 


3 


4 


50 ' 


1 


95 


14,000,000 500.000 




1.024.00 


jobs 


jobjd. 


Clustered Index 


1 . 


2 


0 


0 


95 


' ' 1 353 


4 : 


' 3.79 ~ 


pubjnfo 




Data 


1 


4 


305 


2 


. 95 


8.000,000 '2,000,000 




; 4,096.00 


pubjnfb 


pubjd(FK) 


Clustered Index 


1 


4 


0 


0 . 


95 


9.571 


4 


19.5 " 


publishers 




Data 


2 


B 


90 , 


3 


95 


8,000,000 500.000' 




1,024.00 


publishers 


pubjd 


Clustered Index 


1 


4 


0. 


." 0 


95 


- • 2,394 


4 


4.9 . 


' publishers 


pub_name 


fndex 


0 .-■ 


0 


40 


1 


- 95 


- ,229',631 


6 


470.28 


roysched 




Data 


4 


18 


'. 0 


0 


. 95 ' • 


86,000.000 924/32' 




1,893.85 


sales 




Data 


4 


20 


32: 


2 . 


95 


.• 21,000,000 724,138 / 




1,483.03 


sales 


st or idord num Clustered Index 


1 


4 ; 


20 - 


1 


•95 


: ; 14,200 .' 


5 


29.06 


stares 




Data 


3 


11 


80 \ 


'."3".. 


95 


8,000,000 375.000 




768 


stores 


stortdl 


Clustered Index 


1 


4 1 -. 


q : 


0 . 


95 


- V 1 ,796 : 


4 


3.68 


stores 


storjiame 


Index 


0 


0 


40 


1 


'95 


172,223 :. 


6 


352.71 


titleauihor 




-Data 


4 


22 


o : 


0" 


95 


.25,000.000 324,676 




' 664.94- ; 


title author 


FK 


Clustered Index 


2 . 


; 17 


0 


0 


95 


- - 3,866 .- 


4 


7.92 


titles 




Data 


8 


.54 


280 


2 


95 


18.000,000 6,000,000 




- 12,268.00 


titles 


•■ title jd 


. Clustered Index 


1 ' 


' 6' 


0 


0 


95 


- . . 35,089 


5 


•71.86 


titles 


title ' 


. Index 


0 .'. 


0 


80 . 


1 


95 


956.250 


7 ; . 


1,958.40 
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3.4.2.6.2 Worksheet Results: OLTP Input -Workload 

The worksheet OLTP Input - Workload is created during Step 2 of the sizing process. It contains the specified 
transaction names and rates, and also a description for each one. The Status column on the right indicates that the 
transaction content was defined during the second step. 



Workload Definition 


Txn No. | Txn Name 


| Txn/Sec 


% of Tola 1 1 


Description 


I Status 


1 New Order 


35 


26.9% 




Defined 


2 Payment 


35 


26.9% 




Defined 


3 Add Publisher 


10 


77% 




Defined 


4 Delivery 


35 


26.9% 




Defined 


5 Delete Stores 


15 


11.5% 




Defined . 


Totals 


130 


100.0% 







3.4.2.6.3 Results Worksheet: OLTP Input - Transactions 

The worksheet OLTP Input - Transactions shows the composition of each transaction as defined during Step 2 of 
the sizing. 



Transaction Composition 



Txn No. 



SQL No 



SQL Name 



SQL Type 



.... 2Sel. 

1Sel_Oistrict 
- SQLJJpdate 
3GLJnsert_Qrders 

SQLJnsert 
SQL.SelJ jtem 
SGl_SeMJtem 

SQLJSeM 
SQL Sel 1 stock 



Sel-2 Tables 
SeM Table 
Update 

Insert 

Insert 
SeM Table 
.SeH Table 
SeM fable 
■SeM -Table 



SQLs/| 
Txn 



el ectivitj Columns 



1 JQLSelecljiistric SeM Table 1 

2 3L Select OL-stoi Sel-2 Tables 1 



Legend: 



SQL Type 


SGLsTxrl 


Selectivity 


Insert 


n 


1 


Delete 


n 


4 


Update 


1 


n 


Sel - 1 Table 


1 


n 


Sel - 2 Tables 


1 


n 


Sel • 3 Tables 


1 


n 
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3.4.2.6.4 Results Worksheet: OLTP Sizing Results 

The worksheet OLTP Sizing results contains the same results as were displayed in the System Requirement 
report dialog. When the data is in worksheet format, it can be printed using the standard Excel printing optic 

. — ■ : : — — — 

Enterprise Server Sizing Results for OLTP User Defined Workload 
Client: Your Client's Name 



System Requirements 



Workload 


OLTP User Defined . 


Database Software 


SQL Server 6.5 


System 


Unisys QS/2 


Processor 


400 Mhz Xeon 


Txn/Sec Requirement 


100 


Max. CPU Utilization, % 


. B5 . ; - ■;, 


CPU Requirement 




Effective CPU Utilization, % 


'46. ■■ 


Memory Requirement, MB 


■> 1024 ?: - 


Mass Storaqe Requirement, GB 


78.5 


Communications Requirement 


, 10 Mbit Ethernet 


Communications - Bytes/sec 


393,798 


Communications Utilization % 


31 
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3.4.2.6.5 Results Worksheet: OLTP CPU Load 

The bar chart in the OLTP CPU Load worksheet shows what proportion of the CPU load is attributed to each 
transaction. In the sample chart shown above, transaction 1 accounts for approximately 20% of the total workload, 
transaction 2 accounts for 70%, and transaction 3 is approximately 10%. 



o 



100% 



80% 



0% 



*S 60% 



% 40% 

ZL 
O 

5 20% 



Relative CPU Load 



3 4 
Transaction No. 
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3.4.2.6.6 Results Worksheet: Comm Load 

The bar chart in the Comm Load worksheet shows what proportion of the communications load is attributed to 
each transaction. In the sample chart shown above, over 90% of the load is attibuted to transaction 2. 



100% 



80% 



o 



*S 60% 



S 40% 

c 

o 

£ 20% 



0% 



Comm Load 



3 4 
Transaction No. 
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3.4.2.6.7 Results Worksheet: Capacities - CPU 

The Capacities - CPU worksheet contains a table showing the estimated peak transaction rate that can be 
achieved for each processor configuration based on the defined application workload. 

The sample table shown below contains the estimates for the same sizing as used in previous examples. It shows 
that a maximum 243 transactions/sec could be achieved on a single processor system running the specified 
application workload. The original sizing estimate showed that the specified transaction rate of 31/sec could be 
achieved on a single processor system with an e fleet ive CPU utilization of 13%. 



— — — — 1 • — — — 

Estimated Enterprise Server Capacities 


Client: Your Client's Name 




Estimated Peak Transaction Rate (T ransactions/Sec) 




No. Processors 


. Capacity @ 100% CPU 






1 - 


113 






2 ■ 


219 






" . ' . 3 - 


320 






• ■■ . 4 


416 




Syst 


em Data 




Workload 


OLTP User Defined 






Database Software 


SQL Server 6.5 






System 


Unisys QS/2 






Processor 


. 400 MhzXeon 






Txh/Sec Requirement 


100 









4. Measurement Basis for User Defined Workloads 

Several measurements were taken to obtain processor timings and 10 counts for various SQL types which were 
selected based on their generic relevance to an OLTP application environment. Each of the generic SQLs are 
described in the following sections along with citing example business cases where applicable, 

4.1 Selects 

Three Select examples were used: single table, and two and three table nested joins. Format for each example is; 

• A generic SQL. Where applicable for the nested joins, the lowest table number represents the outer table; and 
the highest, the inner table. 

• A description of the keys/indexes 

• Access method used by the RDBMS 

• The selectivity based on the results of the measurements 

• Applicable business case examples 
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The cases are as follows: 

• Single table selects 

SELECT tl.a, tl.b 
FROM 1 1 

WHERE tl.a in (vl, ... , vn) 
ORDER BY tl.b 

Index description: 
SQL Server: 

PRIMARY, CLUSTERED KEY: 
1 1 : b, c 

NON-CLUSTERED INDEX: 
tl: a 

Access method: via tl.a 

Example Business Cases: 

Find how many orders for a given day and the products sold. 
Find number of customers in a given area. 



• Two table jo in 

SELECT tl.a, t2.a, t2.b, t2.c 

FROM tl, t2 

WHERE 

tl.a IN (vl, ... , vn) AND 

t2.a = tl.a 
GROUP BY tl.a, t2.b, t2.c 
ORDER BY tl.a, t2.c, t2.b 

Index description: 
SQL Server: 

PRIMARY, CLUSTERED KEY: 
tl: a 
t2: a, b 

Access method: 

tl is outer table 

tl is accessed via cluster key 1 1 .a 
t2 is accessed via cluster key t2:a,b 

Selectivity: 

For each row selected from 1 1 , an average of four rows are selected from t2. The selectivity specified 
in the sizer is the number of rows selected from 1 1 . 

Example Business Cases: 

Find all the suppliers that supply a specific product to determine the best price. There would be on the 
average four suppliers per product. 

Find all the airlines that fly to a certain city and determine the best price. 
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• Three table join 

SELECT tl.b, t3.b, tl.d 
FROM tl, t2, 13 
WHERE 

t2.b = xyz' AND 

t2.a = tl.c AND 

t3.a = tl.a AND 
tl.b in (vl, ... , vn) 
ORDER BYt3.b, tl.b 

Index description: 
SQL Server: 

PRIMARY, CLUSTERED KEY 
tl: a 
t2:a 
t3: a, c 

NON-CLUSTERED INDEX 
tl: a,b 

Access method: 

Nested table order (outer to inner): tl, t2, t3 
tl is accessed via non-clustered index tl:a,b 
t2 is accessed via primary, clustered key t2:a 
t3 is accessed via primary, clustered key t3:a 

Selectivity: 

For each value of tl.b (see the where clause), approximately 5 rows in tl match the criterion. For 
each row selected from tl, a row in t2 is selected 20% of the time; for each row selected from t2, an 
average of 3.9 rows are selected from t3. The selectivity specified for the sizer is the number of 
values tl.b specified in the where clause. 

Example Business Cases: 

Find the length of time required to complete customer orders in a given market segment. 
Find the actual arrival times compared to the scheduled arrival times of flights to a given city. 

Note: Using the first example above, the measurement would consist of determining the status of orders placed 
on certain dates from a selected segment of the customer population. For the database used in the 
measurements, the customer segment chosen places about 20% of all of the orders, and each order consists of 
approximately 4 items on the average. In the sizer, the selectivity requested is equivalent to the number of 
orders of a specific kind. 

4.2 Insert 

Each transaction consisted of 10 inserts and was followed by one commit. A measurement consisted of 100 
transactions. 

4.3 Update 

Each transaction consisted of 10 updates and was followed by one commit. A measurement consisted of 1 00 
transactions. 



NT Performance Services 



Unisys Corporation 



Copyright 



A-40 Appendix A. Unisys Enterprise NT Sizer Description and User Guide 



4A Delete 

Each transaction consisted of 10 deletes and was followed by one commit. A measurement consisted of 100 
transactions. 

5. Methodology 

The goal in the sizer's design and development has been to provide a user friendly software tool that will facilitate 
assisting the customer in determining an optimal system configuration that will meet the customer's application 
and workload needs. This sizer is focused on determining system requirements for customer applications that will 
run on the Unisys Enterprise Servers, using the Microsoft Windows NT operating system. 

The NT sizer was first developed as a means to quickly estimate server scalability for the TPC-C benchmark. 
Added to that capability was the ability to quickly compare TPC-C and TPC-D measurement results across 
vendors. This version allows the user to define a database, OLTP application, and workload which are used to 
estimate configuration requirements for NT Servers and for applications that use the SQL Server or the Oracle 
DBMS. The Sizer's capabilities will increase with each new version. 

A major focus of activity in a development of this type of tool is the defining and conducting of certain key 
measurements whose results lend themselves well to estimating, i.e., predicting, the resource usage, and 
consequently, the resource requirement, for specified applications. This section discusses not only the nature of 
these measurements but also the application of these measurements to the prediction methods within the sizer. 
Discussion is scoped to the sizer's current capabilities: TPC-C and user defined OLTP workloads. 

5.1 TPC-C Workloads 

For a TPC-C workload, the intent is to determine the number of processors and amount of memory and mass 
storage required to support the transaction rate where the database size grows with the transaction rate. 

The estimates were taken from TPC-C benchmark measurement results obtained from the Mission Viejo 
performance lab in addition to estimates based on additional measurements. The additional configuration values 
were obtained via curve fitting and prior knowledge of the behavior of SMP systems. 

5.2 OLTP User Defined Workloads 

Each transaction in an OLTP application consists of some mix of SQL" inserts, deletes, updates, and selects. 
However, it is assumed that any query, i.e., SQL select must be of short duration to satisfy the business requirement 
of quick response times. Thus, each select to a single table or to a group of tables (nested join) should be via 
indexed access. Accordingly, three generic, indexed select statements were chosen to represent scope of queries in 
an OLTP environment. This sample is shown together with example business cases in section 4. 

The measurements for these examples were taken on an Enterprise Server, 4 processor, 200 MHz system with l 
GB memory in a client/server computing environment. Additional measurements were conducted on the Aquanta 
XR/6 and the QS/2 systems. The resource measurements were of the server only in processing the request. We 
note that the equipment provided at the network and the client can significantly affect performance; however, the 
scope of the measurements was on the server only. Impact of the client workstation on performance is highly 
dependent on the equipment used and was not considered for these measurements. Calculations for network traffic 
were based on assumptions about the data traffic generated from each SQL. 

The database used for the measurements was tuned, via indexing, etc., to run the candidate OLTP type SQLs. Two 
database sizes, 1 00 MB and l GB, were used for the purpose of verifying measurement results and methodology. 
For each measurement, the primary results used in developing parameter relationships were: processor usage, SQL 
logical TOs, SQL scans, and selectivity (for the queries). 
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For the SQL selects, several selectivity cases were run in order to determine a relationship between resource usage 
and selectivity. Also, each SQL was subjected to the SQL server "Show Plan"; results of Show Plan were then 
compared for similar SQLs to ensure that similar SQLs were always run in the same manner. 

For the insert, delete, and update measurements, a sequence of 10 SQLs was executed before a Commit. The cost 
of a Commit was included in the data used. For deletes, an average of 4 records were noted deleted per delete 
SQL. Thus, the user of the sizer must take this into consideration when defining the application and workload. 

From these measurements we were able to determine parameter relationships which allow us to predict system 
resource usage for similar SQLs defined by the user of the sizer. 

The phenomenon of increased transaction service time due to an SMP environment was also factored into the 
configuration estimates. 



NT Performance Services 



Unisys Corporation 



Copyright 



